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(57) Abstract: The present invention describes a method for selecting a preferable codec mode for a connection between a first 
communication device (UE1) and a second communication device (UE2), where the devices communicate via a network. An element 
in the network observes a quality of said connection. The quality is based on a data error level of network resource. Based on 
the observation the network element judges a preferable operational mode of the codec, from a group of operational modes of the 
codec. The preferable codec mode is based on the quality level of the connection. In the preferred embodiment, if a Radio Network 
Controller (RNC) observes congestion by observing Real-Time Protocol (RTP) header information, the RNC judges a lower codec 
mode for the connection. The lower mode for the connection is judged if there is at least one gap in a sequence of detected packets. 



WO 03/019961 



PCT/FI02/00697 



1 

Selecting an operational mode of a codec 

This invention relates to a selection of an operational mode of a codec between 
communication devices where the communication devices communicate via a 
network. 

In wireless telecommunication systems information is transferred in an encoded 
form between a transmitting communication device and a receiving communication 
device. The transmitting communication device encodes original information into 
encoded information and sends it to the receiving communication device. The 
receiving communication device decodes the received encoded information in 
order to recreate the original information. The encoding and decoding is performed 
in codecs. Thus, the encoding is performed in a codec located in the transmitting 
communication device, and the decoding is performed in a codec located in the 
receiving communication device. However, since there are many different codecs 
available, the transmitting terminal and the receiving terminal have to agree upon 
the codec(s) to be used in a session. The selection of the codec takes place 
during the communication. 

The GSM (Global System for Mobile Communication) codec mode selection over 
air interface is described next. The codec mode related information, which is 
transmitted on each link, contains CMI (Codec Mode Indication(s)) and CMC 
(Codec Mode Command(s)) in the downlink, respectively CMI and CMR (Codec 
Mode Request(s)) in the uplink. The CMI informs the receiver about the currently 
applied codec mode. The CMC informs the other end about the codec mode to be 
applied on the other link. The CMR informs the other end about the preferred 
codec mode on the other link. In the GSM, the codec mode information is 
transmitted in the speech traffic channel, using a part of its transmission capacity. 
Codec modes are constrained to change only every second speech frame. The 
CMCs/CMRs and the CMIs are altered such that they occur only every second 
frame. For codec mode adaptation the receiving side performs link quality 
measurements of the incoming link. The measurements are processed yielding a 
Quality Indicator (Ql). For uplink (UL) adaptation, the Ql is directly fed into the UL 
mode control unit. This unit compares the Ql with certain thresholds and 
generates, also considering possible constraints from network control, the CMC 
indicating the codec mode to be used on the uplink. The CMC is then transmitted 
in the speech traffic channel to the mobile side where the incoming speech signal 
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is encoded in the corresponding codec mode. For downlink (DL) adaptation, the 
DL Mode Request Generator within the mobile compares the DL Quality indicator 
with certain thresholds and generates a CMR indicating the preferred codec mode 
for the DL. The CMR is transmitted in the speech traffic channel to the network 
side where it is fed into the DL Mode Control unit. This unit generally grants the 
requested mode. However, considering possible constraints from network control, 
it may also override the request. The resulting codec mode is then applied to 
encoding of the incoming speech signal in downlink direction. Both for uplink and 
downlink, the presently applied codec mode is transmitted in the speech channel 
as CM I together with the coded speech data. At the decoder, the CM I is decoded 
and applied for decoding of the received speech data. In both UL and DL, there is 
always a transcoder in the network. The transcoder causes delays in the 
communication. Disadvantageously, the codec mode selection is only based on 
the quality of the radio interface. 

The communication of the encoded information is critical for error free data 
communication in real-time applications such as a voice call. For example, in the 
voice call it is more preferable to use lower bit rate such as a lower codec mode 
with fewer errors than higher bit rate with larger number of errors. Generally, the 
communication of the real-time application uses lower bit rates with few data 
errors rather than high bit rates with data errors. The errors are due to packet 
losses or bit errors. Therefore, the selection of the codec is an important 
compromise between the data speed and QoS (Quality of Service). 

One solution to provide feedback on the quality of the data distribution is an 
additional companion protocol, RTCP (Real-Time Control Protocol) operating in 
VoIP (Voice over Internet Protocol) systems. The transmitting communication 
device can make use of RTCP information to adapt the applied encoding scheme 
to changes in the network load in order to improve service at the receiving 
communication device. This requires that the devices support the RTCP that is 
undesirable because the devices would require more processing power and 
memory. The increase in required processing power leads to higher power 
consumption which is undesirable in wireless user terminals operated by a battery. 
Because the RTCP information needs to be communicated in backward direction, 
the communication of the RTCP information reserves and reduces network 
capacity from the actual sen/ices. One solution to reduce the network capacity is a 
header removal technique which is a method where the RTCP can be separated 
from the actual data. Thus, the actual data stream runs separately from the RTCP 
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information. If the header removal is applied to the data stream, the RTCP needs 
to be run on a parallel PDP (Packet Data Protocol) context. However, the header 
removal technique requires additional mechanisms to link or create the removed 
header to the 'header removed data 1 . Therefore, a substantial associative problem 
5 to link the data and the header emerge when the header removal of the packet is 
used because a recognition whether a packet is the RTCP packet or not is very 
difficult. There are other packets with or without the header in the data stream. 
Thereby, substantial difficulties emerge again in the linking. There is a need to 
observe the quality of the entire connection between the transmitting and receiving 
10 communication device and based on the observed quality select a communication 
mode depending on the quality. 

According to a first aspect of the invention there is provided A method for 
selecting an operational mode of a codec of at least a first communication device, 
15 the method comprising: 

communicating from the first communication device information indicating 
from a group of operational modes of the codec that operational mode which the 
first communication device uses for a connection between the first and a second 
communication device where the first and the second communication device 
20 communicate via a network, wherein 

observing a quality level of the connection between the first and the second 
communication device in the network, and based on the observing 

selecting an operational mode of the codec depending on the quality level. 

25 Preferably, the quality level contains a data error rate of the connection. 
Accordingly, in the preferred embodiment the quality level is represented by a 
value indicating an inverse quality of the connection because if the quality level is 
high the actual quality of the connection is worse than normal. Also, if the quality 
level is low, the actual quality of the connection is better than normal. The quality 

30 level can be based on a number of packets lost per a number of packets in 
transmission in a network element. Thus, the quality level can be defined by 
means of the packet loss rate (PLR). This can be computed by the sequence 
number information as 

35 QL = 100 * Number of packets lost / Number of packets in transmission in a 

network element. 

The quality level can depend on for example, congestion in a packet based 
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network or weak coverage of the radio part of the network, thus both resulting in 
bit errors or packet lost. 

Preferably, the step of observing the quality level of the connection contains 
5 observing Real-Time Protocol (RTP) header information of the connection. 

In a more particular embodiment, the step of observing the quality level of the 
connection contains detecting at least one packet loss if there is at least one gap 
in a sequence of detected packets. The sequence is based on values in Real-Time 
10 Protocol (RTP) sequence number fields of the observed packets. Advantageously, 
a network element is able to observe the RTP header information of the 
connection and there is not required additional companion information about the 
quality of the connection. 

15 Advantageously, if the quality level exceeds a certain maximum criteria, the codec 
mode is changed to a lower codec mode than a requested or currently applied 
codec mode. If the quality level is below a certain minimum criteria, the codec 
mode is changed to a higher codec mode than the requested or currently applied 
codec mode. 

20 

Preferably, the communication devices comprise mobile communication devices 
operating in a third generation mobile network, and the network contains a radio 
access and an Internet Protocol (IP) based fixed core. 

25 Preferably, the operational mode of the codec contains an operational mode/bit 
rate of an AMR (Adaptive Multi-Rate) codec. 

In a further embodiment of the invention, the step of selecting the operational 
mode of the codec depending on the quality level contains selecting a particular 

30 codec from the group of operational modes of the codec. The system in the 
embodiment contains different codecs to be applied in the communication 
between the first and the second communication devices via the network. The first 
communication device receives a command from a network entity that the 
particular codec for the connection is to be used. Thus, the network entity has 

35 observed the quality level and based on the observing selected the particular 
codec for the connection. For example, the first and the second communication 
devices contain two codecs, a half rate codec and an Adaptive Multi-Rate (AMR) 
codec. The network entity selects the half rate codec by sending the command at 
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least to the first communication device because the network element has observed 
the reduced quality of the connection. 

According to a second aspect of the invention there is provided a first 
5 communication device for communicating encoded information to a second 
communication device via a network, the first communication device comprising: 

a transmitter for transmitting information to the second communication 
device via the network, the information indicating from a group of operational 
modes of a codec that operational mode which the first communication device 
10 uses for a connection between the first communication device and the second 
communication device, wherein the first communication device further comprises 

means for transmitting the information in a format which enables a network 
element to observe a quality level of the connection between the first and the 
second communication device, and 
15 means for receiving a command from the network element indicating an 

operational mode of the codec depending on the quality level, where the 
operational mode of the codec depending on the quality level is selected by the 
network element. 

20 Preferably, the communication devices are mobile communication devices 
operating in a third generation mobile communication network. 

According to a third aspect of the invention there is provided a system for 
communicating encoded information between a first communication device and a 
25 second communication device the system comprising the first communication 
device, the second communication device and a network, the first communication 
device comprising: 

a transmitter for transmitting information from the first communication 
device to the second communication device via the network, the information 
30 indicating from a group of operational modes of a codec that operational mode 
which the first communication device uses for a connection between the first 
communication device and the second communication device, wherein the 
network comprises 

a quality level observation unit for observing a quality level of the 
35 connection between the first and the second communication device, and based on 
the observing 

the quality level observation unit selects an operational mode of the codec 
depending on the quality level. 
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Preferably, the system comprises a third generation mobile communication 
system. 

5 According to another embodiment, the quality level unit operates in a conventional 
Radio Network Controller (RNC) of the network. 

According to a fourth aspect of the invention there is provided a computer 
program product for a network entity, the computer program product comprising: 

10 computer executable code for enabling the network entity to observe a 

quality level of a connection between a first communication device and a second 
communication device where the first and the second communication device 
communicate via a network over the connection, and where the network entity 
receives information indicating an operational mode of a codec from a group of 

15 operational modes of the codec that the first communication device uses for the 
connection between the first communication device and the second 
communication device, and 

computer executable code for enabling the network entity to select an 
operational mode of the codec depending on the quality level. 

20 

Preferably, the network entity contains at least one of a Radio Network Controller 
of a third generation mobile network, a Base Station Controller, an element in a 
General Packet Radio System (GPRS) Enhanced Radio Access Network 
(GERAN). 

25 

The invention will now be described, by way of example only, with reference to the 
accompanying drawings, in which: 

Figure 1 shows an example of a transport protocol format where the AMR packets 
30 are transmitted over the IP in accordance with the invention; 

Figure 2 shows an example of the RTP header information which is observed in 
accordance with the invention; 

Figure 3 shows an embodiment of the third generation telecommunication system 
where RTP header information is observed; 
35 Figure 4 shows a signalling diagram in changing the codec mode to a codec mode 
which is based on the quality level of the connection according to an embodiment 
of the invention; 
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Figure 5A shows in form of a flow chart a method for selecting the codec mode for 
downlink traffic according an embodiment of the invention; 

Figure 5B shows in form of a flow chart a method for selecting the codec mode for 
uplink traffic according to an embodiment of the invention; and 
5 Figure 6 shows a mobile station according to the invention. 

The present invention is generally directed to an apparatus and a method 
for selecting a preferable codec mode for a connection between a first 
communication device (UE1) and a second communication device (UE2), where 

1 0 the devices communicate via a network. An element in the network observes the 
quality level (QL) of said connection. The quality level (QL) is based on a data 
error level of network resource. Based on the observation the network element 
judges a preferable operational mode of the codec, from a group of operational 
modes of the codec. The preferable codec mode is based on the quality level of 

15 the connection. If the quality level (QL) of the connection is high, a lower codec 
mode than the requested or currently applied codec mode is selected. Also, if the 
quality level (QL) of the connection is low, a higher codec mode than the 
requested or currently applied codec mode is selected. The preferable codec 
mode may also be the currently applied codec. In the preferred embodiment, if a 

20 Radio Network Controller (RNC) observes congestion by observing Real-Time 
Protocol (RTP) header information, the RNC judges a lower codec mode than a 
requested or currently applied codec mode for the connection. The lower mode for 
the connection is judged if there is at least one gap in a sequence of detected 
packets. The lower mode for the connection can also be judged if there is a gap 

25 longer than a predefined threshold. In the preferred embodiment the UE1 is a 
wireless mobile station of a cellular radio network and the UE2 is another wireless 
mobile station of the same or another cellular radio network. An example of the 
cellular radio network is a wideband code division multiple access (WCDMA) 
network or another third generation network. The UE1 and/or the UE2 may also be 

30 fixed terminal operating in a fixed network. For example, the UE1 is an IP 
telephone coupled with an IP based network such as the Internet. 

Figure 1 shows an example of a transport protocol format where AMR 
(Adaptive Multi-Rate) packets are transmitted over the IP in accordance with the 
35 invention. The VoIP (Voice over Internet Protocol) is a term used in IP telephony 
for a set of facilities for managing the delivery of voice information using the 
Internet Protocol (IP) 100. In general, this means sending voice information in 
digital form in discrete packets rather than in the traditional circuit switched 
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protocols of a public switched telephone network (PSTN). Advantageously, the 
VoIP and Internet telephony provide means for reducing the costs of telephone 
service. The VoIP, now used somewhat generally, derives from the VoIP Forum, 
an effort by major equipment providers to promote the use of ITU-T's 
5 (Telecommunication Standardization Sector of the International 
Telecommunications Union) standard for sending voice (audio) and video using 
the IP on the Internet and within intranet. The Forum also promotes for example 
voice mail. 

10 In addition to the IP 100, the communication of voice data uses RTP (Real-Time 
Protocol) 104 to help to ensure that packets get delivered in a timely manner. 
Using the Internet or public networks, it is currently difficult to guarantee the QoS. 
The Real-time traffic in the IP network is typically carried using UDP (User 
Datagram Protocol) 102 which is a very lightweight protocol. As an example, the 

15 additional services that the UDP 102 provides are source and destination ports 
and an optional checksum that covers the UDP/IP header. The RTP 104 can be 
used above the UDP 102 to add end-to-end delivery services that are useful for 
real-time traffic. In particular, the RTP 104 provides transmitted datagrams / 
packets with sequence number, payload type identification and timestamping 

20 services. 

The RTP 104 is defined by the ITU (International Telecommunications Union). The 
RTP 104 is typically integrated into the application layer of the end-applications 
rather than being implemented as a separate layer in the communications 
software stack. The RTP 104 can be used in both unicast and multicast 

25 communication. An RTP session is defined by an IP address and a pair of UDP 
destination ports, one for RTP packets. In case of an audio-video conference the 
audio and video may use different sessions, for example, the same destination 
address but different port pairs. The RTP 104 is a flexible protocol and can be 
tailored to a particular application's needs by using for example profiles. Since 

30 each application uses only a single profile, no explicit indication of which profile is 
in use is necessary with the RTP 104. 

One applicable codec in the invention is the AMR (Adaptive Multi-Rate) speech 
codec. The AMR codec is developed by the ETSI (European Telecommunications 
35 Standards Institute). The AMR codec is standardized for GSM, and is also chosen 
by the Third Generation Partnership Project (3GPP) as the mandatory codec for 
third generation systems. The AMR codec will be widely used in various cellular 
systems. The AMR codec is a multi-mode codec with 8 narrow band speech codec 
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modes with bit rates 4.75, 5.15, 5.90, 6.70, 7.40, 7.95, 10.2 and 12.2 kbps, thus 
resulting in 8 different codec modes according to bit rate. The highest codec mode 
contains 12.2 kbps and the lowest codec mode contains 4.75 kbps. The sampling 
frequency is 8000 Hz and processing is done on 20 ms frames, for example, 160 
5 samples per frame. The AMR codec modes are closely related to each other and 
use the same coding framework. Three of the AMR codec modes are already 
adopted standards of their own, the 6.7 kbps mode as PDC-EFR (Personal Digital 
Communications-Enhanced Full Rate), the 7.4 kbps mode as IS-641 codec in 
TDMA (Time Division Multiple Access), and the 12.2 kbps mode as GSM-EFR. 

10 

Another applicable codec in the invention is the Adaptive Multi-Rate Wideband 
(AMR-WB) speech codec. The AMR-WB codec was originally developed by 3GPP 
(3G Partnership Project) to be used in GSM and 3G systems. The AMR-WB codec 
will be widely used in various cellular systems. The AMR-WB codec is a multi- 

15 mode speech codec with 9 wideband speech coding modes with bit-rates 6.6, 
8.85, 12.65, 14.25, 15.85, 18.25, 19.85, 23.05 and 23.85 kbps, thus resulting in 9 
different codec modes according to bit rate. The highest codec mode contains 
23.85 kbps and the lowest codec mode contains 6.6 kbps. The sampling 
frequency is 16000 Hz and processing is performed on 20 ms frames, i.e. 320 

20 speech samples per frame. The AMR-WB codec modes are closely related to 
each other and employ the same coding framework. Both codecs (AMR and AMR- 
WB) are applicable in the invention. 

In the embodiment of the invention, the multi-mode feature of the codec is used to 
25 preserve high speech quality under a wide range of transmission conditions. In 
mobile communication, the mode selection allows the system to adapt the balance 
between speech coding and error protection to enable the best possible speech 
quality in prevailing transmission conditions. The mode selection can also be 
utilized to adapt to the varying available transmission bandwidth. Advantageously, 
30 the codec(s) can handle mode changing at least to the next lower or upper mode 
at any time, for example, the bit rate is changed to a lower mode 12.65 kbps from 
a higher mode 14.25 kbps, if there is congestion in the network resource. The 
codec(s) can also handle mode changing to another mode. If the quality level (QL) 
of the connection is high i.e. the actual quality is poor, a lower codec mode than 
35 the requested or currently applied codec mode is selected. Also, if the quality level 
(QL) of the connection is low i.e. the actual quality is good, a higher codec mode 
than the requested or currently applied codec mode is selected. The mode 
information is transmitted together with the speech encoded bits, to indicate the 
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mode. For example, the mode information is attached to the RTP frame. 

Both codecs (AMR, AMR-WB) include voice activity detection (VAD) and 
generation of comfort noise (CN) parameters during silence periods. Thus, the 
5 codecs have the option to reduce the number of transmitted bits and packets 
during silence periods to a minimum. The operation to send CN parameters at 
regular intervals during silence periods is usually called discontinuous 
transmission (DTX) or source controlled rate (SCR) operation. The frames 
containing CN parameters are called Silence Indicator (SID) frames. 

10 

Conventionally, the codecs are suitable for circuit swithed cellular systems, but 
due to the flexibility and robustness of these codecs, they are suitable also for 
other preferred application. The preferred applications are real-time services over 
packet switched networks. Still referring to Figure 1, coded voice data (AMR) 106 

15 is transmitted on top of the RTP 104. The AMR format can be designed for 
robustness against both bit errors and packet loss. The preferred transport format 
of the voice data is AM R/RTP/U DP/IP as shown in the Figure 1. The speech 
encoded bits have different perceptual sensitivity to bit errors and alternatively 
cellular systems may exploit this by using unequal error protection and detection 

20 (UEP and UED). 

The payload format supports several means to increase robustness against packet 
loss. The simple scheme of repetition of previously sent data is one possibility. 
Another possible scheme which is more bandwidth efficient is to use payload 
25 external FEC (Forward Error Correction), which generates extra packets 
containing repair data. The whole payload can also be sorted in sensitivity order to 
support external FEC schemes using UEP. 

Several frames can be encapsulated into a single packet to decrease protocol 
30 overhead. One of the drawbacks of such an approach is that in case of packet loss 
this means loss of several consecutive speech frames, which usually causes 
clearly audible distortion in reconstructed speech. Interleaving of frames can 
improve the speech quality in such cases by distributing the consecutive losses 
into series of single frame losses. Interleaving and bundling several frames per 
35 payload will also increase end-to-end delay and is therefore not applicable to all 
types of applications. However, streaming applications are able to exploit 
interleaving to improve speech quality in lossy transmission conditions. 
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Figure 2 shows an example of RTP header information which is observed in 
the RNC (Radio Network Controller) in accordance with the invention. The UE1 (or 
UE2 respectively) increments a sequence number field 200 (16 bits) by 1 each 
time it sends an RTP packet. Alternatively, the RNC1 (or RNC2 respectively) can 
increment the sequence number field 200 by 1 each time it sends an RTP packet. 
The increment enables the RNC2 (or the RNC1 respectively) to restore the initial 
packet sequence in the event of any reordering that may have occurred during 
transmission through the network. Alternatively, the UE1 (or the UE2 respectively) 
can restore the packet sequence. Advantageously, in addition to the reordering, 
the RNC2 (or the RNC1 respectively) can detect packet loss if there is at least one 
gap in the sequence of received packets. The RNC can observe the quality of the 
connection such as the congestion of the network or weak network coverage and 
determine the reduced QoS if there is at least one gap in the sequence of received 
packets. There may also be several gaps in the sequence of received packets. For 
example, if there has been previously received packets with sequence numbers 1 , 
2, 3 and then received packets 7 and 8, the receiving device can detect the 
reduced quality. Advantageously, in case the quality level is below a certain 
threshold, the RNC is able to select the lower codec mode for the connection in 
order to improve the QoS. The initial value of the sequence number is chosen at 
random. A timestamp field 202 (32 bits) reflects the sampling instant of the RTP 
packet and allows the RNC2 or UE2 (RNC1 or UE1 respectively) to ensure that 
each packet is "played" at the correct point in time to ensure that the original timing 
relationship between the samples is maintained. Alternatively, in the case of 
sources that generate RTP packets periodically, for example fixed audio, the 
timestamp does not explicitly represent an absolute time. Instead, it would be 
incremented by one at each sampling interval. 

In addition to the sequence number field 200 and the timestamp field 202, the RTP 
header information may comprise the following fields. A version field 204 (1 bit) 
identifies the version of the RTP 104. If a padding field 206 (1 bit) is set, it 
indicates that the packet contains one or more octets at the end which are not part 
of the payload. For example, in the case if certain encryption algorithms are being 
used. If the value of the padding field = 1 , then the last octet of the packet or 
header indicates how many octets of padding there are in total. If an extension 
field 208 (1 bit) is set, it indicates that the fixed header is followed by a single 
variable length header extension containing additional fields. The third and fourth 
octets of this header extension indicate its length. Thus, applications not 
supporting this header extension can ignore it. This kind of header extension may 
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be used as a temporary measure to allow individual implementations to 
experiment with new payload-format independent functions. If the assessment of 
the new functions proved favourable then they could be captured in a new profile 
specification. A CSRC (contributing source) count field 210 (4 bits) is the number 

5 of CSRC identifiers that follow the fixed header. A marker field 212 (1 bit) allows 
significant points to be marked in the traffic stream. For example, frame 
boundaries can be marked by the marker field 212. A payload type field 214 (7 
bits) identifies the format of the RTP payload. A SSRC (Synchronisation source) 
field 216 (32 bits) identifies the synchronisation source of a group of packets. It is 

10 a randomly chosen value meant to be globally unique within a particular RTP 
session. All packets from the same synchronisation source form a part of the same 
timing and sequence number. An example of a synchronisation source is a 
microphone or a mixer. A CSRC (Contributing Source) list 218 is required when 
mixing is performed. A mixer accepts packets from one or more synchronisation 

15 sources, possibly changes the data format and combines the packets in some 
manner to form a combined RTP packet to be forwarded. The SSRC of the 
combined packet will be equal to that of the mixer while the CSRC list identifies all 
of the original SSRCs that contributed to the combined stream. 

20 Figure 3 shows an embodiment of the third generation telecommunication 

system where the RTP header information is observed. UE1 (first communication 
device) 300 is coupled with RNC1 (first Radio Network Controller) 302 via a radio 
path and a base station. The RNC1 302 comprises a Quality Level (QL) 
observation unit 304. The QL observation unit 304 observes a bit error / a packet 

25 loss rate of the network based on the RTP header information and accordingly the 
RNC (302 or 308) decides whether to change the codec mode of the connection. 
The QL observation unit 304 is a computing software running in a conventional 
RNC. Alternatively, the QL observation unit 304 can be hardware or middleware 
implemented by, for example, a Digital Signalling Processor(s) (DSP). The RNC1 

30 302 communicates with RNC2 (second Radio Network Controller) 308 via a core 
network 306. The core network 306 enables the transfer of IP based data 
information. Moreover, the UE1 300 and the RNC1 (UE2 312 and the RNC2 308 
respectively) can communicate by the IP based data information. The RNC2 308 
comprises the QL observation unit 310 which is equivalent to the QL unit 304 but 

35 operating under the RNC2 308. The RNC2 308 is coupled with the UE2 (second 
communication device) 312 via a radio path. 
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In the embodiment of Figure 4 data traffic relating to the communication 
between the uplink and the downlink can be transferred via the same path. The 
codec related information, request and command (CMI, CMR, CMC) are 
applicable to run in the cellular radio network. 

5 

Figure 4 shows a signalling diagram in changing the applied codec mode to 
the codec mode which depends on the quality level of the connection according to 
an embodiment of the invention. The UE1 sends the information CMI and a 
request for codec mode 12.2 kbps (CMR,) to the RNC1. The actual voice data 

10 communication in accordance with the CMR t and the CMI is designed to the UE2. 
The RNC1 receives and forwards the CMI and the CMR, to the RNC2. The RNC2 
receives the CMI and the CMR, . The RNC2 observes the network traffic and is 
able to study the data stream as referred to the examples of Figures 1 , 2 and 3. If 
the RNC2 detects, based on studying, that a substantial amount of the packets are 

15 missing, the RNC2 creates and stores a new lower value, for example, 10.2 kbps 
for the request CMR, . The substantial amount of missing packets is an adjustable 

amount of the packets depending on the required QoS set by a network operator. 
The RNC2 creates the command CMC indicating the applied codec mode 10.2 
kbps based on the CMR,. The RNC2 transmits the CMC and the information CMI 

20 to the UE2. The UE2 receives the CMC and the CMI and sends a request for 
codec mode 7.95 kbps (CMR 2 ) to the RNC2. The UE2 sends the request for 

codec mode 7.95 kbps because the UE2 has detected reduced quality for the 
radio path between the RNC2 and UE2. The RNC2 receives the CMR 2 . The 
RNC2 compares the stored 10.2 kbps CMR, to the 7.95 kbps CMR 2 . If the value 

25 of the CMR 2 is smaller than the value of the CMR,, the applied codec mode is 
selected to be the lower 7.95 kbps CMR 2 . If the value of the CMR, had been 
smaller than the value of the CMR 2 , the applied codec mode would have been 
selected to be the lower CMR, instead of CMR 2 . Advantageously, the applied 
codec mode can be contained in the one of the RTP or the AMR fields. The codec 

30 requests can be contained in a packet(s) or frame(s), and/or they can be 
contained in transmission protocol field, for example in the RTP or the AMR field. 
In the example of figure 4, if the UE2 had not detected the reduced quality in the 
radio path, the RNC2 would have selected the codec mode 10.2 kbps. 
Advantageously, the codec mode selection is based on the QoS of the actual 

35 network, the network comprising both the radio access and the core network. 

If the flow of speech data is from UE2 to UE1 , the RNC1 has the essential role in 
the codec mode selection process, and advantageously the communication 
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between the UE2 and UE1 may take place in accordance with the preferable 
communication mode. 

In Figures 5A and 5B a flow of speech data is considered to be: UE1- 
5 >RNC1->RNC2->UE2. The inverse data flow, UE2->RNC2->RNC1->UE1, is also 
applicable and operates inversely. 

Figure 5A shows a method for selecting the codec mode for downlink traffic 
according to an embodiment of the invention. The RNC2 receives packets as the 

10 intended communication between UE1 and UE2 takes place (Step 500). The 
RNC2 makes Quality Level (QL) measurements (Step 502) and is able to detect 
packet losses based on the data communication between the UE1 and the RNC2 
including, but not limited, communication over the radio interface and the core 
network. The RNC2 observes the network traffic and is able to study the data 

15 stream as referred to the examples of Figures 1, 2 and 3. In condition 504, based 
on the measurements if the QL is greater than a maximum threshold 
(MaxThreshold), the RNC2 selects a lower codec mode than the requested or 
currently applied codec mode (Step 506). The RNC2 selects the lower codec 
mode by sending a request indicating lower codec mode (CMR LO ) to the RNC1. 

20 For example, the RNC2 has received a codec mode request for 23.05 kbps codec 
mode, and the RNC2 observes reduced quality and changes the codec mode 
request to the 19.85 kbps CMR W . The selection of the lower codec mode 
comprises also sending the command CMC LO to the UE1 . This can be a separate 
message or contained in the request CMR^ which is sent to the RNC1. In the 

25 condition 504, if the QL is not greater than the Maxthreshold, the RNC2 performs 
another condition 508. In the condition 508, if the QL is smaller than a minimum 
threshold (MinThreshold), the RNC2 selects a higher codec mode than the 
requested or currently applied codec mode (Step 510). The RNC2 selects the 
higher codec mode by sending a request indicating higher codec mode (CMR H/ ) 

30 to the RNC1 . For example, The RNC2 has received a codec mode request for 
15.85 kbps, and the RNC2 observes that the QL is smaller than the MinThreshold 
and changes the codec mode to the 18.25 kbps CMR^. The selection of the 
higher codec mode comprises also sending the command CMC HI to the UE1. The 
CMC^ intended to the UE1 can be a separate message or contained in the 

35 request CMR m which is sent to the RNC1 . In the condition 508, if the QL is not 
smaller than MinTheshold, the codec mode is not changed and the process ends 
(Step 512). 
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Figure 5B shows a method for selecting the codec mode for uplink traffic 
according to an embodiment of the invention. The RNC1 receives packets as the 
intended communication between the UE1 and the UE2 takes place (Step 514). 
The RNC1 makes Quality Level (QL) measurements (Step 516) and is able to 
detect packet losses based on the communication between the UE1 and the 
RNC1 including, but not limited, communication over the radio interface and the 
core network. The RNC1 observes the data traffic and is able to study the data 
stream as referred to the examples of Figures 1, 2 and 3. In condition 518, based 
on the measurements if the QL is greater than a maximum threshold 
(MaxThreshold), the RNC1 selects a lower codec mode than the requested or 
currently applied codec mode (Step 520). The RNC1 selects the lower codec 
mode by sending a command indicating lower codec mode (CMC i0 ) to the UE1. 

For example, the RNC1 has received a codec mode request for 12.2 kbps codec 
mode, and the RNC1 observes congestion and changes the codec mode to the 
10.2 kbps CMC i0 . Thus, the CMC i0 indicates lower codec mode than the codec 

mode which is initially received or currently applied at the RNC1. In the condition 
518, if the QL is not greater than the Maxthreshold, the RNC1 performs another 
condition 522. In the condition 522, if the QL is smaller than a Minimum Threshold 
(MinThreshold), the RNC1 selects a higher codec mode than the requested or 
currently applied codec mode (Step 524). The RNC1 selects the higher codec 
mode by sending a command indicating higher codec mode (CMC H/ ) to the UE1. 
For example, the RNC1 has received a codec mode request for 6.70 kbps codec 
mode, and the RNC1 observes that the QL is smaller than the MinThreshold and 
changes the codec mode to the 7.40 kbps CMC W . In the condition 522, if the QL 
is not smaller than MinTheshold, the codec mode is not changed and the process 
ends (Step 526). 

The minimum threshold (MinThreshold) indicates respectively the minimum quality 
level below which it is allowed to switch to a higher codec mode. The maximum 
quality threshold (MaxThreshold) respectively indicates the maximum quality level 
above which it is recommended to switch to a lower codec mode by the network 
element such as the RNC. The distance between the MinThreshold and the 
MaxThreshold determines the sensitivity of the process. The MinThreshold and 
the MaxThreshold can be defined by the network operator. The quality of the 
connection in packet based network, comprising at least the radio path and the 
core part, can depend generally on two factors: 1) The number of packets in a 
queue and 2) the size of the packets in the queue. For quality measurement RNCs 
do not make use of the RTCP information, but the RTP header information. In 
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particular, the RNCs make use of the sequence number field to check if the natural 
sequence spaced by one unit has been broken or interrupted, due to one or more 
packet lost. 

The QL is defined by means of the packet loss rate (PLR). This can be computed 
by the sequence number information as 

QL = 100 * Number of packets lost / Number of packets in transmission in 

RNC. 

Because of the nature of the speech, it may be useless to keep all the history of 
packet losses since the beginning of the flow. For example, it is not important if the 
speech quality was bad 1 minute ago, if it is good now. The QL measurement can 
be restricted to the latest period of time T, where T can be defined by the operator. 
In synthesis, points of time to determine the QL are the following. 1) When the 
codec (AMR) mode is changed. These operations happen when the traffic 
checked in RNC is really transmitted with a new codec (AMR) mode. 2) During a 
normal operation if time > T. Thus, if the time index T is exceeded. 

Another variable that is defined by the network operator is the amount of time 
needed before the codec (AMR) mode can be changed. For example, it could be 
unsuitable to react too quickly when a packet loss occurs, but instead it would be 
better to change mode if the lossy condition persists for 1 or 2 seconds. This time 
can be defined by a variable R (reaction time). 

A transcoding enables a usage of different codec mode between different 
elements having different codecs in the network. For example, between the MS 
and the PSTN in the GSM, where the MSC (Mobile Switching Centre) contains the 
transcoding. The transcoding further enables a usage of different codec (AMR) 
mode between the UE1 and the UE2 when the voice call is active. Preferably, the 
codec (AMR) modes should be the same in the UE1 and in the UE2 when the 
voice call is activated, but the core network can adapt the communication by 
transcoding if the codec (AMR) modes of the UE1 and the UE2 are different. 
Disadvantageously, the transcoding creates delays in the communication. 

Figure 6 shows a cellular mobile station 600 according to the invention. The 
mobile station 600 shown operates as the UE1. A corresponding mobile station 
may operate as the UE2. The mobile station 600 comprises a processing unit 
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CPU, a radio frequency part RF and a user interface Ul. The radio frequency part 
RF and the user interface Ul are coupled to the processing unit CPU. The user 
interface Ul comprises a display and a keyboard (not shown) to enable a user to 
use the mobile station 700. In addition, the user interface Ul comprises a 
microphone and a speaker for receiving and producing audio signals. The user 
interface Ul may also comprise voice recognition (not shown). The processing unit 
CPU comprises a microprocessor (not shown), memory MEM and software SW. 
The software SW is stored in the memory MEM. The microprocessor controls, on 
the basis of the software SW, the operation of the mobile station 600, such as the 
use of the radio frequency part RF and the presenting of information in the user 
interface Ul and the reading of inputs received from the user interface Ul. The 
software SW comprises a WCDMA protocol stack on the basis of which a 
transmitter (not shown) of the radio frequency part RF transmits and a receiver 
(not shown) of the radio frequency part RF receives messages and other 
information with the aid of its antenna ANT. The codecs, the selection of which is 
negotiated, reside in the mobile station 600. They may be implemented in the 
software SW. Another alternative is hardware implementation of the codecs (not 
shown). 

Particular implementations and embodiments of the invention have been 
described. It is clear to a person skilled in the art that the invention is not restricted 
to details of the embodiments presented above, but that it can be implemented in 
other embodiments using equivalent means without deviating from the 
characteristics of the invention. The scope of the invention is only restricted by the 
attached patent claims. For example, the observation can be done in a unit in the 
edge of the 3 rd generation network with an assumption that operator controlled 3 rd 
generation network does not substantially lose packets. For another example, the 
observation can be done in a Base Station Controller. For another example, the 
observation can be done in an element in a General Packet Radio System (GPRS) 
Enhanced Radio Access Network (GERAN). 



WO 03/019961 



PCT/FI02/00697 



18 

Claims 

1. A method for selecting an operational mode of a codec of at least a first 
communication device (UE1), the method comprising: 

communicating from the first communication device (UE1) 
information indicating from a group of operational modes of the codec that 
operational mode which the first communication device (UE1) uses for a 
connection between the first (UE1) and a second communication device 
(UE2) where the first and the second communication device (UE1, UE2) 
communicate via a network, characterised by 

observing a quality level (QL) of the connection between the first and 
the second communication device (UE1 , UE2) in the network, and based on 
the observing 

selecting an operational mode of the codec depending on the quality 
level (QL). 

2. A method according to claim 1, wherein the step of observing the quality 
level of the connection comprises detecting at least one packet loss if there 
is at least one gap in a sequence of detected packets, wherein the 
sequence is based values in Real-Time Protocol (RTP) sequence number 
fields (200). 

3. A method according to claim 1, wherein the step of observing the quality 
level of the connection comprises observing Real-Time Protocol (RTP) 
header information of the connection. 

4. A method according to claiml, wherein an element (302, 308) of the 
network observes the quality level (QL). 

5. A method according to claim 1, wherein the quality level (QL) comprises a 
data error rate of the connection. 

6. A method according to claim 1, wherein the quality level (QL) is based on a 
number of packets lost per a number of packets in transmission in a 
network element. 

7. A method according to claim 1 , 2 or 3, wherein the step of selecting an 
operational mode of the codec comprises selecting a lower codec mode if 
the quality level is above a certain maximum level. 
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8. A method according to claim 1, 2 or 3, wherein the step of selecting an 
operational mode of the codec comprises selecting a higher codec mode if 
the quality level is below a certain minimum level. 

9. A method according to claim 2, wherein the operational mode of the codec 
depending on the quality level (QL) is selected if a certain amount of packet 
loss is observed. 

10. A method according to claim 1, wherein an element (302, 308) of the 
network selects the operational mode of the codec depending on the quality 
level (QL) by sending a command indicating the operational mode of the 
codec depending on the quality level (QL) at least to the first 
communication device (UE1). 

1 1. A method according to any of the preceding claims, wherein at least one of 
the communication devices (UE1, UE2) comprises a mobile communication 
device. 

12. A method according to claim 1, wherein the network comprises a radio 
access network and a fixed core network. 

13. A method according to any of the preceding claims, wherein the operational 
mode of the codec comprises an operational mode of an Adaptive Multi- 
Rate (AMR) codec. 

14. A method according to claim 13, wherein the operation mode of the 
Adaptive Multi-Rate (AMR) codec comprises a bit rate of the Adaptive Multi- 
Rate (AMR) codec. 

15. A method according to claiml, wherein the step of selecting the operational 
mode of the codec depending on the quality level (QL) comprises selecting 
a particular codec from the group of operational modes of the codec. 

16. A first communication device (UE1) for communicating encoded information 
to a second communication device (UE2) via a network, the first 
communication device (UE1) comprising: 

a transmitter (RF) for transmitting information to the second 
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communication device (UE2) via the network, the information indicating 
from a group of operational modes of a codec that operational mode which 
the first communication device (UE1) uses for a connection between the 
first communication device (UE1) and the second communication device 
5 (UE2), characterised in that the first communication device (UE1) further 

comprises 

means for transmitting the information in a format which enables a 
network element (302, 308) to observe a quality level (QL) of the 
connection between the first and the second communication device (UE1, 
10 UE2), and 

means for receiving a command from the network element (302, 
308) indicating an operational mode of the codec depending on the quality 
level (QL), where the operational mode of the codec depending on the 
quality level (QL) is selected by the network element (302, 308). 

15 

17. A system for communicating encoded information between a first 
communication device (UE1) and a second communication device (UE2) 
the system comprising the first communication device (UE1), the second 
communication device (UE1) and a network, the first communication device 

20 (UE1) comprising: 

a transmitter (RF) for transmitting information from the first 
communication device (UE1) to the second communication device (UE2) 
via the network, the information indicating from a group of operational 
modes of a codec that operational mode which the first communication 
25 device (UE1) uses for a connection between the first communication device 

(UE1) and the second communication device (UE2), characterised in that 
the network comprises 

a quality level observation unit (304, 310) for observing a quality 
level (QL) of the connection between the first and the second 
30 communication device (UE1 , UE2), and based on the observing 

the quality level observation unit (304, 310) selects an operational 
mode of the codec depending on the quality level (QL). 

18. A computer program product for a network entity (302, 308), the computer 
35 program product comprising: 

computer executable code for enabling the network entity (302, 308) 
to observe a quality level (QL) of a connection between a first 
communication device (UE1) and a second communication device (UE2) 
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where the first and the second communication device (UE1, UE2) 
communicate via a network over the connection, and where the network 
entity (302, 308) receives information indicating an operational mode of a 
codec from a group of operational modes of the codec that the first 
communication device (UE1) uses for the connection between the first 
communication device (UE1) and the second communication device (UE2), 
and 

computer executable code for enabling the network entity (302, 308) 
to select an operational mode of the codec depending on the quality level 
W- 

19. A computer program product according to claim 18, wherein the network 
entity (302, 308) comprises at least one of a Radio Network Controller of a 
third generation mobile network, a Base Station Controller and an element 
in a General Packet Radio System (GPRS) Enhanced Radio Access 
Network (GERAN). 
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